Recurrent neural networks for phoneme recognition
نویسندگان
چکیده
This paper deals with recurrent neural networks of multilayer perceptron type which are well-suited for speech recognition, specially for phoneme recognition. The ability of these networks has been investigated by phoneme recognition experiments using a number of Japanese words uttered by a native male speaker in a quiet environment. Results of the experiments show that recognition rates achieved with these networks are higher than those obtained with conventional non-recurrent neural networks.
منابع مشابه
Continuous Speech Phoneme Recognition Using Dynamic Artificial Neural Networks
Phoneme classification and recognition is the first step to large vocabulary continuous speech recognition. This step represents the acoustic modeling part of such a system. In hybrid speech recognition systems phoneme recognition is made by artificial neural networks (ANN’s). The main objective of this paper is the investigation of dynamic ANN’s, namely the Time-Delay Neural Networks (TDNN) an...
متن کاملAcoustic Models Based on Non-uniform Segments and Bidirectional Recurrent Neural Networks
In this paper a new framework for acoustic model building is presented. It is based on non-uniform segment models, which are learned and scored with a time bidirectional recurrent neural network. While usually neural networks in speech recognition systems are used to estimate posterior "frame to phoneme" probabilities, they are used here to estimate directly "segment to phoneme" probabilities, ...
متن کاملRecurrent Neural Networks Design by Means of Multi-Objective Genetic Algorithm Case study : Phoneme Recognition
Evolutionary algorithms are considered more efficient for optimal system design because they can provide higher opportunity for obtaining the global optimal solution. This paper introduces a method for construct and train Recurrent Neural Networks (RNN) by means of Multi-Objective Genetic Algorithms (MOGA). The use of a multi-objective evolutionary algorithm allows the definition of many object...
متن کاملPhoneme Probability Estimation with Dynamic Sparsely Connected Artificial Neural Networks
This paper presents new methods for training large neural networks for phoneme probability estimation. An architecture combining time-delay windows and recurrent connections is used to capture the important dynamic information of the speech signal. Because the number of connections in a fully connected recurrent network grows super-linear with the number of hidden units, schemes for sparse conn...
متن کاملNew variant of the Self Organizing Map in Pulsed Neural Networks to Improve Phoneme Recognition in Continuous Speech
Speech recognition has gradually improved over the years, phoneme recognition in particular. Phoneme recognition plays very important role in speech processing. Phoneme strings are basic representation for automatic language recognition and it is proved that language recognition results are highly correlated with phoneme recognition results. Nowadays, many recognizers are based on Artificial ne...
متن کامل